A New Hybrid Critic-training Method for Approximate Dynamic Programming

نویسندگان

  • Thaddeus T. Shannon
  • George G. Lendaris
چکیده

A variety of methods for developing quasi-optimal intelligent control systems using reinforcement learning techniques based on adaptive critics have appeared in recent years. This paper reviews the family of approximate dynamic programming techniques based on adaptive critic methods and introduces a new hybrid critic training method.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A comparison of training algorithms for DHP adaptive critic neurocontrol

A variety of alternate training strategies for implementing the Dual Heuristic Programming (DHP) method of approximate dynamic programming in the neuro-control context are explored. The DHP method of controller training has been successfully demonstrated by a number of authors on a variety of control problems in recent years, but no unified view of the implementation details of the method has y...

متن کامل

An Introduction to Adaptive Critic Control: A Paradigm Based on Approximate Dynamic Programming

Adaptive critic control is an advanced control technology developed for nonlinear dynamical systems in recent years. It is based on the idea of approximate dynamic programming. Dynamic programming was introduced by Bellman in the 1950’s for solving optimal control problems of nonlinear dynamical systems. Due to its high computational complexity, applications of dynamic programming have been lim...

متن کامل

Primitive Adaptive Critics

We propose a simple framework for critic-based training of recurrent neural networks and feedback controllers. We term the critics that are used primitive adaptive critics, since we represent them with the simplest possible architecture (bias weight only). We derive this framework from two main premises. The first of these is a natural similarity between a form of approximate dynamic programmin...

متن کامل

Stochastic Control Strategies and Adaptive Critic Methods

Adaptive critic methods have common roots as generalizations of dynamic programming for neural reinforcement learning approaches. Since they approximate the dynamic programming solutions, they are potentially suitable for learning in noisy, nonlinear and nonstationary environments. In this study, a novel probabilistic dual heuristic programming (DHP) based adaptive critic controller is proposed...

متن کامل

A New Framework for Advancement of Power Management Strategies in Hybrid Electric Vehicles

Power management strategies play a key role in the design process of hybrid electric vehicles. Electric Assist Control Strategy (EACS) is one of the popular power management strategies for hybrid electric vehicles (HEVs). The present investigation proposes a new framework to advance the EACS. Dynamic Programming method is applied to an HEV model in several drive cycles, and as a result, some op...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000